A Critique of PT-Symmetric Quantum Mechanics 



Ali Mostafazadeh* 



Department of Mathematics, Kog University, 
Rumelifeneri Yolu, 34450 Sariyer, 
Istanbul, Turkey 



Abstract 

We study the physical content of the PT-symmetric complex extension of quantum 
mechanics as proposed in Bender et al, Phys. Rev. Lett. 80, 5243 (1998) and 89, 270401 
(2002), and show that as a fundamental probabilistic physical theory it is neither an 
alternative to nor an extension of ordinary quantum mechanics. We demonstrate that 
the definition of a physical observable given in the above papers is inconsistent with the 
dynamical aspect of the theory and offer a consistent notion of an observable. 

PACS numbers: 03.65-w, 11.30.Er 

The past five years have witnessed a great deal of research activity on the subject of VT- 
symmetric quantum mechanics. This was mainly initiated by Bender and Boettcher's demon- 
stration pQ that the spectrum of the Hamiltonians: 



was actually real, positive, and discrete. Here the operators p and x are defined according to 

(fnp)(x) = —id x ijj(x), (xip)(x) = xip(x), and for v > 2 the eigenvalue problem for H is defined by 

imposing vanishing boundary conditions on an appropriate contour C in the complex x-plane 
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PQ. The interest in the properties of the Hamiltonians (0) was boosted by the more recent 
findings of Bender, Brody, and Jones [2] who showed that a (unitary) probabilistic formulation 
of PT-symmetric quantum mechanics based on the Hamiltonians $1} was possible. 

The initial announcement of the results regarding the reality of the spectrum of the Hamil- 
tonians (0) led to a great deal of surprise and controversy, for such a Hamiltonian was ap- 
parently "non-Hermitian" . Bender and his collaborators pQ argued that the unusual spectral 
properties of these Hamiltonians was due to their PT-symmetry: [H,VT] = 0, where P and 
T stand for the "parity" and "time-reversal" operators defined by (Pip)(x) := x) and 
(Tif))(x) := ip(x)*, respectively. They also showed that the eigenf unctions 4> n of H, i.e., the 
solutions of the Schrodinger equation 



fulfilling vanishing boundary conditions on the contour C were orthogonal with respect to 
both the indefinite PT-inner product: (ip,4>) '■= J c dx [PTi/)(x)]<fi(x), and the positive-definite 
CPT-inner product [2|: 



where C is the so-called charge-conjugation operator defined through its "position-representation" 
according to C(x,y) = £) n 0»(aO0 n (y). 

The main assertion of |2j is that not only the PT-symmetric Hamiltonians (0) can be used 
to define the real energy levels of a quantum system, but that they are also capable of defining a 
unitary time-evolution provided that one adopts the CPT-inner product (jSJ) to define a Hilbert 
space structure on the space of state vectors. It is this latter observation that has raised 
the expectations of a number of theoretical physicists to consider the PT-symmetric quantum 
mechanics as a possible alternative to or an extension of the ordinary quantum mechanics 
(QM). The aim of this letter is to show that these expectations do not have a valid ground and 
that indeed a consistent probabilistic PT-symmetric quantum theory is doomed to reduce to 
ordinary QM. 
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We begin our analysis by elaborating on a number of ambiguities that have so far obscured 
the physical content of the PT-symmetric quantum mechanics. 

It is often claimed that unlike Hermiticity the PT-symmetry is a physical requirement, for 
it means symmetry under space-time reflections [2j. This argument rests on the assumption 
(illegally imported from the ordinary QM) that the variable x appearing in Eq. (J2J) is to be 
associated with the position of a particle, i.e., a point in the physical configuration space. In 
QM the argument x of a wave function does not have an a priori physical meaning. It is 
merely a (generalized) eigenvalue of a linear operator x. x inherits a physical meaning from x 
which represents a physical observable called the position. This in turn relies on the postulates 
that in QM physical observables are self-adjoint (Hermitian) operators acting in the space of 
state vectors of the system, that the latter is a separable Hilbert space (or an appropriately 
extended Dirac space), and that there is a well-defined measurement theory that describes how 
one should associate the theoretical predictions with the experimental data. Even these alone 
are not sufficient to relate the variable x to a point in the physical space unless one enforces 
Dirac's canonical quantization program and makes a connection between the observable x and 
the position of the corresponding classical particle. 

This argument shows that before being able to identify the values of the variable x of Eq. (J2J) 
with points of the physical space or viewing V as the usual parity (space-reflection) operator, 
one must first formulate the PT-symmetric quantum mechanics as a physical theory. This 
entails addressing the following questions: 1. What is the mathematical nature of the space of 
state vectors? 2. What are the observables? 4. How are the observables measured? 4. How 
does the theory relate to known theories? Alternatively, is there a correspondence principle 
that would, for example, justify calling the operator x of Eq. (JTJ) the "position operator" . 

As far as the results of Refs. [TJ |2| are concerned, x is just the independent variable of 
the solutions of a certain eigenvalue problem for the differential operator H. Nevertheless, it 
is a fact that this eigenvalue problem defines an associated complex vector space V, namely 
the span of the eigenfunctions of H, and that this vector space can be endowed with an ap- 
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propriate (positive-definite) inner product and made into a separable Hilbert space (through 
Cauchy completion [3 ). It is important to note that before constructing this Hilbert space one 
cannot decide whether the operator H is Hermitian or not. This also raises the issue of certain 
ambiguity in the terminology used in |2] . 

Often, one uses the terms "Hermitian" and "self-adjoint" synonymously for operators A 
that act in a Hilbert space /C and satisfy the defining relation: 

{^A<t>) = {A^<t>), (4) 

where ^,<j> G K, are arbitrary and (•, •) denotes the inner product of /C. It is a simple result 
of linear algebra that the matrix representation of a self-adjoint operator A in an orthonormal 
basis is a Hermitian matrix, i.e., if A satisfies (j3J), then the matrix elements A^ of A in any 
orthonormal basis satisfy Aij = A* { . This is the origin of the use of the term "Hermitian" for a 
self-adjoint operator A (that by definition satisfies (@J).) Apart from the technical issues related 
to the domain of A, there is no danger of using this terminology in ordinary QM where the 
Hilbert space has a fixed inner product and one works with orthonormal bases that are usually 
formed out of the eigenvectors of the relevant (commuting) observables. 

The situation is quite different in PT-symmetric QM where a priori neither the inner prod- 
uct nor the observables are fixed. As a result, one must refrain from calling the Hamiltonians 
"non-Hermitian" and referring to C(x,y) as the "position representation" of the operator C. In 
fact, as the general results reported in j3] and the particular constructions given in [2j show, 
the Hamiltonians are "Hermitian" with respect to some inner product after all. 

The use of the term "non-Hermitian" for these and similar Hamiltonians P] |2| stem from a 
rather naive definition of a Hermitian operator according to which a linear operator A is called 
a "Hermitian operator" if its matrix representation in the x-representation is a Hermitian 
(infinite) matrix, A(x, y)* = A(y,x). Unlike the definition of a Hermitian operator based on 
the condition (Jlj) . this definition suffers from the fact that it is manifestly basis-dependent. In 
fact, it is not difficult to see that the operator x is not self-adjoint with respect to the CVT- 
inner product. Therefore, the x-basis is not an orthonormal basis and the above-mentioned 
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correspondence between the Hermiticity (self-adjointness) of an operator (as defined by (jlj) 
and the Hermiticity of its matrix representation does not hold in the x-representation. This 
shows that taking A(x,y)* = A(y,x) as the definition of a "Hermitian operator" is quite 
misleading. 

The same problem arises when one defines the notion of a "symmetric operator" j2j through 
the requirement that the matrix elements of an operator A in the ^-representation form a 
symmetric matrix, A(x,y) = A(y,x). In particular, the proposal of identifying physical ob- 
servables with "symmetric" CPT-invariant linear operators outlined in [2j is ill-defined unless 
a prescription is provided that fixes a basis so that one can determine whether an operator 
admits a symmetric matrix representation in this basis. As we argued above the choice of the 
^-representation used in [2] cannot be motivated by physical considerations. 

Next, we wish to return to the problem of determining the physical observables for the VT- 
symmetric quantum systems defined by the Hamiltonians Having used the inner product 
(jHI) to endow the vector space V with the structure of a Hilbert space 7i, we ensure that the 
time-evolution determined by the time-dependent Schrodinger equation, id t ip{t) = Hip(t), is 
unitary. If we wish to adopt the same measurement theory as in the ordinary QM, we are forced 
to define the observables as Hermitian operators acting in 7i. This definition differs from the 
one given in [2j that identifies the observables with symmetric CPT-invariant operators. We 
will first explore the consequences of our definition of the observables and then show using a 
finite-dimensional toy model that indeed the definition given in [2] is inconsistent. 

The identification of the observables with Hermitian operators acting in the Hilbert space 
7i immediately excludes the operators x and p as candidates for physical observables, for unlike 
the Hamiltonian H they fail to be Hermitian with respect to the inner product (jSJ) of 7i. In 
particular, they cannot be identified as the quantum analogs of the position and momentum 
of a classical particle. This in turn leads to the natural question: What are the position and 
momentum operators in the "PT-symmetric quantum mechanics? 

In order to respond to this question, we recall a well-known mathematical result about 
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Hilbert spaces, namely that up to isomorphisms there is a unique infinite- dimensional separable 
Hilbert space [3J Theorem II. 7]. This implies that 7i may be mapped onto the Hilbert space 
L 2 (R) by a unitary linear transformation, i.e., there exists a linear map U : 7i — > L 2 (IR) 
satisfying 

0M)+ = (£AW>, (5) 

where if), <f> £ T~i are arbitrary and (-|-) is the usual L 2 -inner product. 

Equation (JHJ) suggests a direct method of constructing physical observables for the VT- 
symmetric quantum systems associated with the Hamiltonians (JTJ. These have the general 
form O = U~ l o U where 6 is a Hermitian operator acting in L 2 (M). For example the position 
and momentum observables are respectively described by X = U~ x x U and P = lA~ x p U where 
x and p are the position and momentum operators of ordinary QM. 

The uniqueness of the Hilbert space structure has another important consequence: One 
can describe the PT-symmetric systems defined by the Hilbert space 7i, the Hamiltonian H, 
and the observables O in terms of an ordinary quantum system having L 2 (M) as the Hilbert 
space, h := UHU' 1 as the Hamiltonian, and 6 = UOIA^ 1 as the observables. In this sense 
the PT-symmetric QM actually reduces to the ordinary QM. Therefore the claim that it is a 
fundamental physical theory extending QM is not valid. 

The operator U used in the above discussion is in general nonlocal: The corresponding 
similarity transformation maps differential operators to nonlocal (non-differential) operators. 
In particular, the Hamiltonian h will most probably not have the standard (kinetic + potential) 
form. The same nonlocal behavior is also shared by the observables O, in general, and by the 
position and momentum operators X and P, in particular. 

Because a closed-form expression for the operator C that enters in the expression for the 
inner product (J3J) is not available, one cannot obtain the explicit form of the operator U and 
consequently the Hamiltonian h and the observables X and P. One may however attempt to 
obtain approximate expressions for the latter using the series expansion method outlined in [3] . 
As pointed out in jE] there are finite-dimensional analogs of the "PT-symmetric Hamiltonians (0) 
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where the operator C is represented by a simple matrix. For these systems one may use the 
machinery of the theory of pseudo-Hermitian operators [7J EJ |Hl E| to construct the operators 
U and h explicitly JUj- in the following, we shall offer an independent re-examination of a 
two-dimensional model introduced in |2| to provide a concrete realization of our general results 
and to demonstrate the shortcomings of the definition of the observables given in [2|. 
Consider the matrix Hamiltonian [2] 

( re ie s \ 

H: =\ i6 ' ( 6 ) 

i s re I 

where r, s, 6 G K, s 7^ 0, and |rsin(#)/s| < 1. Let T be the operation of complex-conjugation 
of vectors ip G C 2 , V : = tr 1 , and C := sec a <7i + itana <r 3 , where are the Pauli matrices: 

°^{\\\ ff2 ^(!"o ! )- a ^{l-i)- (7) 

and a G (— 7r/2,7r/2) is defined by sina = rsin6/s. Then as shown in j2], is PT- and 
C-symmetric and has real eigenvalues: e± = r cos #±scos a. Furthermore, H is Hermitian with 
respect to the CVT- inner product: 

foM) + :=(C7TV)-& (8) 

where ip,<p E C 2 , a dot means the ordinary dot product: ip ■ <p := X^=i an d the subscript 
i labels the components of the corresponding two-dimensional complex vector. Clearly, if is a 
two-dimensional analog of the PT-symmetric Hamiltonians (JIJ. 

According to j2] , the Hilbert space Ti. of the PT-symmetric Hamiltonian (jUJ) is obtained by 
endowing C 2 with the CVT- inner product, and the physical observables are symmetric matrices 
commuting with CVT. The latter have the general form 

O = acr + (b + c sin ot)a\ + (c — b sin a)a 3 . (9) 

where do stands for the 2x2 identity matrix and a, b, c are arbitrary real parameters. Note 
that here the problem of the basis-dependence of the notion of a "symmetric operator" has 
been avoided, because a choice for a basis of C 2 , namely the standard basis {( J ), ( ° )}, is 



made and all the relevant operators are described by their matrix representations in this basis. 
The latter is however not an orthonormal basis with respect to the CPT-inner product (JSJ), and 
there is no physical reason for choosing it over the other bases of C 2 . 

Next, we give an equivalent quantum description of the above system using a self-adjoint 
Hamiltonian h acting in the ordinary two-dimensional Hilbert space C 2 endowed with the 
Euclidean inner product: (ifj\4>) : = J^Li^i^i- We shall denote this Hilbert space by S. 

Introducing 

1 / e ia/2 e -W2 \ 
U '~ V2 cos a y -ie iQ / 2 ie ia ' 2 J ' ' 101 

we can easily check that indeed for all 6 C 2 , (■0,0)+ = (Uip\U(j)), i.e., U is a unitary 
operator mapping Ti, onto S. Furthermore, we have 

h := UHU' 1 = ( + \ — r cos# a + s cos a cr 3 , (11) 

V e / 

which is clearly a self-adjoint operator acting in S. It describes the dipole interaction of a 
nonrelativistic spin-half particle with a magnetic field aligned along the z-axis. The observables 
of this system are simply the spin operators s M = and their linear combinations with real 
coefficients. 

In our formulation, the observables O associated with the PT-symmetric description (TC, H) 
of the above system are obtained from the observables o of the ordinary quantum description 
{£ , h) via the similarity transformation defined by U. They have the general form: 

3 

fi=0 

where are real constants and S M := U~ l s^U. More specifically 
S = ]-<Jo, S 1 = —~a 2 , S 2 = - (tana o x - sec a a 3 ), 5*3 = -(sec a; o\ +tana cr 3 ). (13) 

£i £i £i Zi 

The physical system described by the PT-symmetric Hamiltonian H, the Hilbert space 7i, 
and the observables O can be more conveniently described in terms of the Hamiltonian h, the 
Hilbert space S, and the observables o = J2^=o c ^ -V ^ s we mentioned above, this system 
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corresponds to the dipole interaction of a nonrelativistic spin-half particle with a magnetic 
field. 

It is not difficult to see that indeed the observables (JHJ) constitute a three-parameter sub- 
family of the observables (j!2j) : the former are linear combinations of Sq, S2 and S3 with real 
coefficients. This shows that at least for the model ©, our definition of an observable is 
more general than the one given in [2|. We will next show that indeed the latter is physically 
unacceptable and that it leads to an explicit inconsistency in the Heisenberg picture. 

The restriction that the observables must be symmetric (or CPT-invariant) matrices ex- 
cludes the possibility of measuring Si. In physical terms it means that one cannot measure the 
spin of the particle along the x-axis. This is clearly an artificial restriction without any physical 
justification. It is also in conflict with the isotropy of the Euclidean space. Relaxing the (basis- 
dependent) requirement of symmetry and defining the observables as arbitrary CPT-invariant 
operators is also not viable for it violates the condition that the observables must have a real 
spectrum. 

For the PT-symmetric system defined by the Hamiltonian (JHJ), the Heisenberg-picture 
observables Oh are related to the Schrodinger-picture observables O according to Oh if) = 
e itHQ e -itH n OW; } e t Q = s 2 which according to (fTT?J) and (JHJ) is both symmetric and CVT- 
invariant, i.e., it is an observable in the sense of [2]. One can use the similarity transformations 
that relate H and S2 to h and S2 and the properties of the Pauli matrices to compute the 
Heisenberg-picture observable associated with S2. This yields 

On{t) = sin[2s cos(a)t] Si + cos[2s cos(a)t] 5*2. (14) 

Because Si is neither symmetric nor CPT-invariant, so is On{t) (except when t is an integer 
multiple of 7r/(2scosa) and On{t) = ±£2.) This shows that defining observables as symmetric 
CPT-invariant operators is inconsistent with the dynamical aspects of the theory; an observable 
becomes non-observable as soon as one turns on the dynamics! Our definition of observables 
as Hermitian operators acting in H, does not suffer from such an inconsistency. 

In summary, if one describes a physical system by a PT-symmetric Hamiltonian that gen- 

9 



erates a unitary time-evolution and if one enforces the standard rules of quantum measurement 
theory to acquire the information about the associated physical quantities, then one can de- 
vise an ordinary quantum mechanical description of the same system. In this sense, there is no 
physical motivation for developing PT-symmetric quantum mechanics. However, in the passage 
from the PT-symmetric to ordinary quantum description a standard Hamiltonian is generally 
mapped to a nonlocal operator while the opposite is true for the position and momentum 
operators. 

In spite of the difficulties with viewing PT-symmetric quantum mechanics as a genuine 
extension of ordinary QM, one must admit that its study has been quite rewarding. For in- 
stance, it has raised some fundamental issues such as the possibility of an essentially dynamical 
determination of the inner product of the Hilbert space [TJ HJ |2] and the formulation of the 
theory of pseudo-Hermitian Hamiltonians HJ |H1 E] that has applications in relativistic quan- 
tum mechanics [IT] and quantum cosmology A proper understanding of the role of VT- 
symmetric Hamiltonians in effective/phenomenological theories ^3] and especially the physical 
significance of the exceptional spectral points [Uj arising due to the spontaneous break-down 
of "PT-symmetry awaits further study. 

This work has been supported by the Turkish Academy of Sciences in the framework of the 
Young Researcher Award Program (EA-TUBA-GEBIP/2001-1-1). 
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